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Abstract 

Motivated by Leinster-Cobbold measures of biodiver- 
sity, the notion of the spread of a finite metric space 
is introduced. This is related to Leinster's magnitude 
of a metric space. Spread is generalized to infinite 
metric spaces equipped with a measure and is calcu- 
lated for spheres and straight lines. For Riemannian 
manifolds the spread is related to the volume and 
total scalar curvature. A notion of scale-dependent 
dimension is introduced and seen, numerically, to be 
close to the Hausdorff dimension for approximations 
to certain fractals. 




1 Connection to Leinster-Cobbold diversity |2] 

2 Comparison with magnitude |4] 

3 Generalization to non-finite metric spaces [7] 

4 Dimension and fractals H] 

Introduction 

Given a finite metric space X with metric d we define 
the spread E (X) by 

E o(X) : = E r P -d(x,x')- 
xex L e v ' 
x'ex 

This is one of a family of metric space 'sizes' as 
we will see in Section Q] where we will also see the 
motivation for this definition. 

This generalizes easily from finite metric spaces to 
arbitrary metric spaces with a measure, as will be 
seen in Section |3] 



Figure 1: Spread profile of the three-point space R. 



One of the things that we will be interested in 
is how this measure of size alters as the metric is 
scaled, so we need to define some notation. For t > 
let tX denote the metric space X with the metric d 
scaled up by a factor of f, so that the distance in fX 
between x and x' is td(x,x'). We can consider the 
spread profile of the space X which is just the graph 
of E (fX) for t > 0. 

An example of a profile is given in Figure [TJ We 
consider the space tR, for t > 0, having three points, 
two of which are a distance f apart and are both 
a distance 1000£ from the third point. This family 
of metric spaces can be thought of as having three 
'regimes': where f is very small and there looks likes 
there is one point; where f is smallish and there looks 
like there are two points; and where t is very large 
and it looks like there are three points. This way 
of thinking is reflected in the values plotted. So we 
wish to think of the spread E as akin to an 'effective 
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number of points'. 

The basic properties of spread can now be given. 

Theorem 1. For X a finite metric space with N points, 
the spread has the following properties: 

• 1 < E (X) < N; 

• Eg(fX) is increasing in t; 

• E {tX) -> 1 as t -> 0; 

• E (fX) -> N as t -> oo; 

• E (X) < e diam(X)^ 

These results all follow easily from the definition. 

In this paper we consider further properties of 
the spread which are summarized in the following 
synopsis. 

Synopsis 

In the first section we recall Leinster and Cobbold's 
diversity measures |4J and show how that gives rise 
to the spread E of a metric space as the order-zero 
diversity of the metric space equipped with the uni- 
form probability distribution. This means that the 
spread can be thought of as an analogue of the num- 
ber of species in an ecosystem. We all see that there 
is a spread E (; of order q for all < q < oo and relate 
these to generalized means. 

In the second section we compare the spread with 
Leinster's notion [3J of the magnitude of a metric 
space. We show that if the space has a positive 
definite 'similarity matrix' then the magnitude is an 
upper bound for the spread, and if the space is homo- 
geneous then the magnitude is equal to the spread. 
We then go on to consider a space with no magni- 
tude, or rather, a space whose magnitude profile is 
discontinuous and see that the spread profile is very 
similar, but much better behaved. Finally in this sec- 
tion we see that two spaces with the same magnitude 
profile can have different spread profiles. 

In the third section we show that the definition 
of spread generalizes immediately to infinite metric 
spaces which are equipped with a measure. We 
calculate the spread of the straight line interval 
of length I with the usual Lebesgue measure and 
show that for large I the spread is approximately 
1/2 + In 2. Then we consider the spread of compact 



Riemannian manifolds, giving the spread of the n- 
sphere explicitly, and we show that asymptotically, 
as the manifold is scaled up, the leading order terms 
in the spread are determined by the volume and the 
total scalar curvature of the manifold. 

In the final section we consider the growth rate of 
the spread as the metric space is scaled. This gives a 
scale dependent notion of 'dimension'. For instance 
if millions of points are formed into the shape of a 
square, then at small scales it will look like a point, 
i.e. zero dimensional, at medium scales it will look 
two dimensional, and at very large scales it will look 
like a collection of isolated points, i.e. zero dimen- 
sional, again. We look at numerical calculations for 
some simple approximations to fractals and see that 
at the medium scales the 'spread dimension' is re- 
lated to the Hausdorff dimension of the fractal. 

1 Connection to Leinster-Cobbold 
diversity 

In this section we recall the notion of Leinster- 
Cobbold diversity measures and show how this gives 
rise to the spread of a metric space. We also see how 
generalized notions of spread relate to generalized 
means. 

1.1 Definition of the diversity measure 

In [4] Leinster and Cobbold defined certain 'diver- 
sity measures'. These are numbers associated to any 
finite set equipped with a probability distribution 
and a 'similarity matrix' — we will see that a met- 
ric on a finite set gives rise to a similarity matrix 
in a canonical way. These numbers are supposed 
to measure the biodiversity of a community where 
the points represent the different species, the simi- 
larity matrix represents the similarity between the 
species (a metric represents distance between the 
species) and the probability distribution represents 
the relative abundances of the species. 

Before defining the diversity measures we need 
to define the notion of a similarity matrix. If X is a 
finite set with N points {xj, . . . , x N } then a similar- 
ity matrix Z is an N x N matrix with < < 1 
and Z i; = 1. If Zjj = then this represents x,- and X: 
being completely dissimilar and if Z ( y = 1 then this 
represents x ; and x. being completely identical. A 
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metric d on X gives rise to a similarity matrix Z by 
setting Zjj := exp(—d(x i ,Xj)) / so that nearby points 
are considered very similar and far away points dis- 
similar. 

Given a finite set X with N points {x^, . . . , x N } 
equipped with a probability distribution p = 
{pi, . . . , Pn}, so that Yui Pi = 1/ an d a similarity ma- 
trix, Z, for q G [0,oo], define the Leinster-Cobbold 
diversity of order q by 



? D z (p) := 



i: Vl >0 



< n (z P )r Pi 

i:pi>0 



l-'/ 



mm 

!:Pi >0 (Zp); 



<?7^1, 



9 = 1, 



q 1 = 00. 



For fixed X, p and Z, the graph of ,? D z (p) against 
9 is known as the diversity profile. As a function of 
q, (? D z (p) is monotonically decreasing. We also have 
1 < ?D z (p) < N and the order q diversity can be 
thought of as an 'effective number of species'. 

The Leinster-Cobbold diversity measures general- 
ize a classic family of diversity measures known as 
the Hill numbers HJ. The Hill number of order q, 
^Hill (p) r for q G [0, oo] is defined for a finite set X 
with probability distribution p on it, so it requires 
no metric or similarity matrix. This Hill number can 
be obtained as the Leinster-Cobbold diversity of the 
identity similarity matrix, or, equivalenty, of the 'dis- 
crete' metric where all of the points are infinitely far 
apart from each other; so all species are considered 
to be completely dissimilar. Symbolically, we have 



'Hill( 



<?D 7 



The Hill numbers at the values q = 0, 1, 2, oo give, 
respectively, the following classical diversity mea- 
sures: the number of species, the exponential Shan- 
non index, the Simpson index and the reciprocal 
Berger-Parker diversity. 

We can think of this specialization to Hill numbers 
as using the Leinster-Cobbold diversity measure to 
get measures of a finite probability space by equip- 
ping the space with a canonical metric, namely the 
discrete metric. On the other hand we could use the 
Leinster-Cobbold diversity measures to get measures 
of the size of a finite metric space by equipping the 



space with a canonical probability distribution, namely 
the uniform distribution. This gives rise to the spread. 
Symbolically, for a metric space X with N points, de- 
fine the ^-spread E (? (X), for < q < oo, by 

E,(X,d):=*D«P(- d >((£ 1)). 

Explicit formulas are given in Table [T] By the mono- 
tonically decreasing nature of the Leinster-Cobbold 
diversity measures we have that q < q' implies that 
Eg(X) > E [? /(X). In this paper we have generally con- 
centrated on the greatest of the these values, E (X) 
which we just call the spread; this is the analogue of 
the 'number of species' in an ecosystem. 



E,(X) 




i 



l/N 



N 2 



U,j= 



1 %ij 



q^l,oo, 



0, 



q = l, 



2, 



mm 

i=l,...,N 



N 



E 1 - 



N 



1 Zij 



Table 1: Writing := exp(— d{x i ,Xj)), we have these 
explicit formulas and special cases of the ^-spread. 



1.2 Generalized means 
mean similarity 



and reciprocal 



Fundamental to the definition of the Leinster- 
Cobbold diversity measures is the idea of generalized 
mean |4J . Here we give a description of the ^-spread 
in those terms. 

Suppose that X is a finite metric space with N 



points {x 



l, ■ ■ 



}, then each point x, has a recip- 



rocal mean similarity denoted by p, and defined, as 
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the name suggests, as follows: 



Pi 



N 



-d(xi,Xj) 



We have 1 < < N and think of the reciprocal mean 
similarity as being a measure of how different the 
space is from the point x,, with being nearly 1 if 
all the points are close to x, and nearly N if all of the 
points are far from X(. 

In order to get a measure of the whole space we can 
take an average of these reciprocal mean similarities. 
There are many different averages we could take. For 
a set of numbers a := {^i, ■ ■ and a number 

s e EU {±00}, the s-mean y s (a) is defined, when 
s 7^0, ±00 as 



F s (a) 



N 



N 

£«! 

i=l 



1/s 



and as a limit when s = 0, ±00. This includes many 
standard means: }f° is the maximum, y 2 is the 
quadratic mean, y 1 is the arithmetic mean, y° is the 
geometric mean, is the harmonic mean, and y~°° 
is the minimum. These have various nice properties, 
but the interesting one to note here is that if Sj > s 2 
then }i s i (a) > y S2 (a) with equality if and only if all 
of the numbers in a are equal. 

FoiQ q E [0,oo], the ^-spread E ? (X) of the met- 
ric space X is by definition the (1 — q)-mean of the 
individual reciprocal mean similarities: 

E,(X):=^(p). 

We have 1 < E»(X) < N with EJX) being near to 1 
if all of the points are close to each other and E (X) 
being near to N if all of the points are far away from 
each other. 



2 Comparison with magnitude 

In this section we recall Leinster's notion of magni- 
tude and show how it relates to the spread. We look 
at examples of a metric space with no magnitude 
and two metric spaces with the same magnitude. 



The (j-spread can also be defined for negative q, but the prop- 
erties are slightly different and we do not consider that case here. 



2.1 Recap on magnitude 

Magnitude was introduced by Leinster in |3|. It is 
defined for 'most' metric spaces in the following way. 
For X a metric space a weighting on X consists of a 
weight w x 6 ]R for each x G X such that 



-d{x,y) 



1 for all y G X. 



If a weighting exist then |X| the magnitude of X is 
defined to be the sum of the weights: 



IXI 



UK 



xex 



If it exists then the magnitude is independent of any 
choice in the weighting. The definition of magnitude 
comes from enriched category theory, although it had 
previously appeared in the biodiversity literature [7J. 
For an example of a space without a magnitude see 
Section 2.3 below. There are large classes of spaces 
for which the magnitude is known to exist: one class 
of spaces on which it is defined is the class of 'posi- 
tive definite spaces'. A positive definite finite metric 
space is a finite metric space for which the similarity 
matrix Z is positive definite. Examples of positive 
definite spaces include subspaces of Euclidean space. 
One nice property of the magnitude of positive defi- 
nite spaces is the following. 

The Positive Definite Subset Bound (J3j Corol- 
lary 2.4.4]). If X is a positive definite space then \X\ 
is well defined, furthermore ifBcX then B is also posi- 
tive definite and \B\ < \X\. 

The magnitude is related to the Leinster-Cobbold 
diversity via the 'maximum diversity' |X| , . Be- 
fore defining that we say that a space X has a non- 
negative weighting if there is a weighting for X in 
which all of the weights are non-negative. The max- 
imum diversity is defined to be the maximum of 
the magnitudes of subsets of X with a non-negative 
weighting: 



IXI 



+ •" 



max 

BCX non-neg 



For instance, from the Positive Definite Subset Bound 
it follows that if X is positive definite with non- 
negative weighting then | X | + = | X | . The connection 
with diversity is given by the following theorem. 
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Leinster's Maximizing Theorem (|2 Theorem 3.1]). 

For X a metric space and for any q 6 [0, oo] the maximum 
value of the Leinster-Cobbold diversity of order q, over all 
probability distributions on X, is given by the maximum 
diversity: 



sup^D z (p) 
P 



IX 



+ ■ 



This explains the name. The maximum diversity 
is certainly in some sense much better behaved than 
the magnitude, however it is considerably harder to 
calculate in general. 

2.2 Comparing spread with magnitude 
We can now look at some basic comparisons. 
Theorem 2. Suppose that X is a finite metric space. 

1. The spread ofX is bounded-above by its maximum 
diversity: 

E (X) < |X| + . 

l.lfX is positive definite then its maximum diversity 
is bounded above by its magnitude, and thus so is its 
spread: 

E (X) < |X| + < |X|. 

Proof. 1 . This follows from immediately Leinster 's 
Maximizing Theorem and the interpretation of 
Eq(X) as the order-zero Leinster-Cobbold diver- 
sity of X with the uniform probability distribu- 
tion. 

2. By definition, the maximum diversity of X is the 
magnitude of a subset B of X, so by the Positive 
Definite Subset Bound, if X is positive definite 



then IXI 



B < X 



□ 



Note that the positive definite condition in part [2] 
of the above theorem cannot simply be removed as 
we will see in Section 2.3 that there is non-positive 
definite space with magnitude smaller than spread. 

We can show that the spread is actually equal to 
the magnitude in the special case of a homogeneous 
metric space. Recall that a homogeneous space is a 
space in which the points are all indistinguishable, 
or, more precisely, a homogeneous metric space is a 
space with a transitive action by a group of isome- 
tries. 



Theorem 3. If X is a homogeneous finite metric space 
then the magnitude and the spread coincide: 



E (X) 



IXI. 



More generally, the magnitude is equal to the q-spreadfor 
all q £ [0,oo]: 

E (? (X) = |X|. 

Proof. If X has N points, then the Speyer's For- 
mula 15 Theorem 1] for the magnitude of a homoge- 
neous space, we have for any x £ X that 



|X| 



N 



E„—d(x,x') ' 



On the other hand, every point in X has the same 
mean reciprocal similarity p, with 

N 



Ep— d(x,x' 



The ^-spread (X) is just the (1 — tj)-mean of the set 
of mean reciprocal similarities. But the (1 — (7 ) -mean 
of N copies of p is just p, thus 



E„(X) 



|X|, 



as required. 



□ 



There is slightly interesting notational coincidence 
when Z is invertible: 



N / N \ -1 

E (X) = E E Z /; 

/=i \/=i 



N N 



|X| 



i=l}=l 



2.3 A space with no magnitude 

Here we look at an example, given by Leinster in |3|, 
which has a discontinuity in its magnitude profile 
and look at its continuous spread profile. Con- 
sider the five-point space K 3 2 illustrated in Figure |2l 
equipped with the metric induced by the pictured 
graph, so that points on the same side are a dis- 
tance 2 from each other and points on opposite 
sides are a distance 1 from each other. As we scale 
this space, considering £K 3 2 for t > 0, we see that 
when t = ln(2 1 ^ 2 ) this has no magnitude. However, 
E (tK 5 2 ) is defined for all values of t > 0, and this 
seems to be a well-behaved version of the magni- 
tude. This example also shows that magnitude is not 
always an upper bound for the spread. 
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0.01 0.1 1 10 100 

Figure 2: A five-point space X 3 2 with the plot of the singularity in its magnitude profile together with its 
spread profile. 



It is straightforward to generate examples of metric 
spaces with many points on the magnitude profile 
not defined, for instance you can use maple to take a 
random graph with say 100 vertices and an expected 
valency of 10 at each vertex. 

We summarize here some of the good properties 
that the spread has when compared with the magni- 
tude. 

• The spread E is defined for all metric spaces. 

• As an N-point space is scaled up, the spread £ 
increases from 1 to N. 

• It is much easier to calculate E (X) than |X|. 
2.4 Trees with the same magnitude 

A further class of metric spaces to consider is that 
of trees. Given a tree, that is a graph with no cycles, 
we get a metric space consisting of the set of vertices 
and the edge-length. We can generalize an example 
of Leinster J3J Example 2.3.5] to show that all trees 
with the same number of vertices have the same 
magnitude. 

Theorem 4. Suppose that T N is a tree with N vertices 
for N > 1, then the magnitude function is given by 

_ N( e t -l)+2 

l fi Nl- -r^ ■ 



Proof. Observe that if N = 1 then 1^1=1 and the 
result holds. Suppose that N > 1. Pick a leaf v, i.e. a 
univalent vertex, of the tree T N . Let A be the metric 
space T N \ v and let B be the submetric space of T N 
consisting of v and its adjacent vertex. Then by J3j 
Corollary 2.3.3] we have 

\tT N \ = \tA\ + \tB\ - 1 = \tA\ + ^-A-j - 1 

= fA + _ -, 
e f + 1 

and as A is a tree with N — 1 vertices the result 
follows by induction. □ 

Let's look in particular at two extreme examples; 
these are pictured in Figure |3j On the one hand we 
have L N the linear tree with N vertices; on the other 
hand we have C N the corona with N vertices, that 
is the tree with one "central" vertex which has an 
edge to each of the other vertices, and there are no 
other edges. The corona can be thought of as 
the complete bipartite graph Both of these 

N-trees give positive-definite metric spaces. By the 
above theorem these two spaces have the same mag- 
nitude function. However, they have various dif- 
ferent properties. For instance, their diameters are 
distinct, diam(L N ) = N — 1 and diam(C N ) = 2; and 
tL N always has a positive weighting, whereas C N 
has a negative weight on the central point if N > 5 
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Figure 3: The linear tree and the corona with six 
points. 



(and £C 4 has a negative weight on the central point 
iff<ln(2)). 

The spread distinguishes these spaces. An easy 
calculation gives the following. 



3 Generalization to non-finite met- 
ric spaces 

The spread of a finite metric space was defined by us- 
ing the canonical uniform probability measure on the 
underlying finite set. The definition generalizes im- 
mediately to any metric space equipped with a finite 
mass measure. If (X, d) is a metric space equipped 
with a measure ]i such that }i(X) < oo, then we can 
define the spread of X by 



EnfX) 



dji(x) 



E (fC N ) 



N 

1=1 1 



-t(i-i) 



-t(N-i) 

N - 



1 



l + (N-l)e- f l + e- f + (N-2; 



,-2t 



This is really the spread with respect to the associated 
probability measure y/ y(X), but the two factors of 
y(X) cancel in the numerator and denominator. For 
Eg(X) with q > the total mass }i(X) makes an 
appearance in the definition. 

We can now look at the following examples: the 
line interval with the Lebesgue measure; the n-sphere 
with its intrinsic metric and standard measure; and, 
asymptotically any compact Riemannian manifold. 



It is not too hard to calculate the maximum diversity 
function either. As the linear tree is positive definite 
and has a positive weighting the maximum diversity 
is precisely the magnitude. The corona does not 
always have a positive weighting, and one finds that 
the central point needs to be 'switched off when 
the corona is scaled down sufficiently; this gives the Theorem 5. We have 
following maximum diversity function. 



3.1 The closed line interval 

We can quite straightforwardly calculate the spread 
of the length £ line interval L( equipped with the 
standard Lebesgue measure. 



\tc 



N{e f -l)+2 

N\+-\ JV_1 

1 + (N - 2)e~ 2t 



Eo(I* 



t > ln(N 
t < ln(N 



2) 
2) 



arctanh(\/l — e~^) 

' 



The linear tree and the corona with 10 points are 
compared in Figure [4] Whilst they have the same 
magnitude they clearly do not have the same maxi- 
mum diversity nor spread: the linear tree has greater 
spread than the corona. The magnitude of both 
spaces grows essentially linearly to start with (see 
Section Eb, which is not really the right thing for the 
corona. The spread (and maximum diversity) grows 
linearly for the linear tree and exponentially for the 
corona, reflecting the geometry of these spaces some- 
what more. 



and asymptotically, as i — > oo, 

E (L,)-(^/2 + ln(2))^0. 

Proof. This is just a case of calculating the integral. 
First observe that for x 6 [0,£], 



yeL e 



f ' 

Jy=0 



\ x -y\ 



dy 



e~ x+ ydy + 



-y+x 



dy 



\y=0 



y=x 



-(<-*) 
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Figure 4: Profiles for the linear tree and the corona with 10 points. The magnitude profile — the dotted 
line — is the same for both spaces. The exponential in the diameter bounds the maximum diversity and the 
spread. 



Thus 



E (^) 



dx 



x =o 2- (e~ x + e~ (*-*)) 
^ e~ x dx 
=0 2e- x - (e~ 2x +e~ e ) 
t e~ x dx 
=0 (1 - e~ e ) - (1 - e' x f 



arctanh 



1-e - 



VT 



x=0 



arctanh(Vl — e~ £ ) 



whence, as 1 — Vl — e~ e decays exponentially to 1, 

Eo(L/)-(^/2 + ln(2))^0 as £ -> oo 
as required. □ 

This result should be compared with the magni- 
tude for the interval of length i > 1 5 1 : 

\L t \ = 1/2 + 1. 

So asymptotically they have the same leading order 
term but different sub-leading terms. 

As an aside, we can easily calculate the integral the- 
oretic versions of E 2 and for the interval. Again, 



Now to consider the asymptotic behaviour as oo, asymptotically these have the the same leading order 



observe 



arctanh(z) = j In ( - 



1+z 



\ In 



(1 + z) 2 



Thus 

Eo(^) 



= ln(l +z) - \ ln(l -z 2 ). 
In(l + Vl -e- e ) - 1 ln(l - (1 - e~ 1 )) 



terms, but different sub-leading order terms. 

Theorem 6. For i > we have the following results for 
the length I interval. 

1. E 2 (L e ) 



ln(l + Vl -e~ e ) + 1 



Vl -e~ 

2 



2e-2(l-e- e )' 

2. E 2 (L £ ) - {1/2 + 1/2) -> as i -> oo. 

3 - E co(^) = 2 (l_g-</2)- 

4. ^(Li) - 1/2 -+ as I -too. 
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Proof. 1 . This is obtained from the integral version 
of the order two spread: 



(SxeL t Mx)Y 



L £ L e f ye L e e - d{XAj) My)dn(x) 
= f 

"/io(2-( e - + e -(^)))d F (x) 

_ I 2 

~ 21 - 2(1 - tr l ) ' 

2. This follows from the above. 

3. This is obtained from the integral version of the 
order-infinity spread: 

F (J ; - ■ f S y eL t My) 
°° (M "^/ yei/ -^)d F (y) 

inf 



xg[0/] 2- (e-- T + e- ( - i ~ x '>) 



□ 



2(l-e-^ 2 )' 
4. This follows from the above. 

3.2 Riemannian manifolds 



A Riemannian manifold is a smooth manifold 
equipped with a Riemannian metric, so in particular 
has an inner-product on each tangent space. This 
structure gives rise to both a metric and a measure 
on the manifold. The metric comes about because the 
Riemannian metric can be used to define a length for 
each rectifiable path in the manifold and the distance 
between two points is defined to be the infimum of 
the lengths of all the paths between the two points. 
The measure comes about because the Riemannian 
metric can be used to define a volume form which 
leads to a density and a measure. This means that ev- 
ery Riemannian manifold has a well-defined spread 
given by the formula 



Eo(X) 



dx 



In the case of homogeneous Riemannian manifolds 
this coincides with the formula for the magnitude 



that was examined in [9]. In particular this tells us 
that the spread of the n-sphere of radius R with 
its intrinsic metric, for n > 1 is given by 



E (S£) 



n/2 



7lR 



n((^: 

(n-l)/2 



n (od 2 +i 
i=i 



n even 



n odd. 



Moreover, the methods employed in f9[ to calculate 
the asymptotics carry over essentially unchanged but 
work for all closed Riemannian manifolds and not 
just homogeneous ones. 

Theorem l.lfX is an n-dimensional Riemannian man- 
ifold (without boundary), with co n denoting the volume 
of the unit n-ball, vol(X) denoting the volume ofX and 
tsc(X) denoting the total scalar curvature ofX then as X 
is scaled up the asymptotics of the spread are as follows: 



Eo(fX) 



n\ co„ 



t n vol(X) 



1 



n-2 



tsc(X) 



6 

+0(t"- 4 )) ast->oo. 



Proof. This is almost identical to the proof of Theo- 
rem 11 in |9] except that now the scalar curvature is 
not a constant and should be written as t(x). □ 

This simplifies in the case n = 2 as follows. 

Corollary. For E a Riemannian surface, the spread is 
asymptotically given in terms of the area and the Euler 
characteristic by 



area(X^ 
2tt 



t 2 + x(z) + o(r 2 ) ast-> 



Proof. This follows from the theorem above as co 2 = 
re and the Gauss-Bonnet Theorem says that tsc(E) = 

4^(2:). □ 



4 Dimension and fractals 

In this section we define the notion of spread dimen- 
sion of a metric space which is the instantaneous 
growth rate of the spread of the space. This notion of 
dimension is scale dependent. For instance, we will 
see that a long, thin rectangular array of points can 
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have spread dimension close to zero, one, or two, de- 
pending on the scale. Finally, we look at the spread 
dimension of some finite approximations to simple 
fractals and see that the spread dimension is close to 
the Hausdorff dimension at some scales. 



4.1 Definition of spread dimension 

Now that we have a notion of size of a metric space, 
we can look at the growth rate of this size as a mea- 
sure of the 'dimension' of the space. Typically one 
looks at the asymptotic growth rate as a measure of 
dimension, but it is interesting here to look at the 
instantaneous growth rate. The size is very scale 
dependent in a non-obvious way so looking at how 
the growth rate varies is very interesting. For a real- 
valued function / defined on some subset of the reals, 
we can define the growth rate at t by 



(G/)M 



dHf(t)) 
dln(f) ' 



For example, if /(f) = f then Gf(t) = n. Another 
way of writing this is as 



(Gf)(t):-- 



t df(t) 
fit) dt • 



The instantaneous growth rate is the gradient in a 
log-log plot of the function. 

We define dim (X) the instantaneous spread di- 
mension, or just spread dimension, of a metric 
space X to be G/(l) where f(t) : = E (fX), in other 
words, 



dim (X) :-- 



dln(E (fX)) 



dln(f) 
t dE (fX) 
E (fX) dt 



f=i 



It is then informative to look at examples of this in- 
stantaneous spread dimension as the space is scaled. 
The following examples were calculated using maple 
on a processor with 16GB of RAM. 

4.2 Rectangular grids 

As a first set of examples we can look at three types 
of rectangular grids with equally spaced points. The 
spread-dimension profiles are shown in Figure [5] 



Starting first with the grid of 1 x 30000 points, or, 
in other words, a line of 30000 points, we see that 
when the points are very close together the spread 
dimension is close to zero, reflecting the fact that the 
Tine' at that scale is point-like. As the line of points 
is scaled up, it looks more and more like a line, so 
when the interpoint distance is 0.01 units, meaning 
the length is 300 units, the spread dimension is close 
to one. As the line of points is scaled up further and 
further, so that the interpoint distance is 10 units, 
say, the point-like nature is apparent and the spread 
dimension drops to zero. 

Considering the square grid of 220 x 220 points, 
we see that this starts off looking like a point at small 
scales, with the spread dimension being close to zero, 
then as the square grid is scaled up to about 20 units 
by 20 units, with an interpoint distance of about 0.1 
units, it looks more like a genuine square and has 
an spread dimension of just under two. Then as the 
square grid is scaled up further, the point-like nature 
is apparent and the spread dimension drops to zero. 

The most interesting case shown is where we con- 
sider the rectangular grid of 10 x 4900 points. Again, 
at small scales the spread dimension is close to zero 
whilst the grid looks like a small point. Then as it is 
scaled up there is a regime, around where the rect- 
angle is of the order of 0.1 units by 50 units, where 
the space looks 'line-like' and the dimension is ap- 
proximately one. As it is scaled up further to around 
10 units by 500 units, the width is apparent and the 
spread dimension heads towards two. Finally, as it 
is scaled up further, the point-like nature becomes 
apparent and the spread dimension descends to zero. 

From this we deduce that the spread E is clearly 
measuring something geometric. 

4.3 Fractals 

We now look at the spread dimension of certain 
finite approximations to fractal sets in Euclidean 
space, namely to the ternary Cantor set, the Koch 
curve and the Sierpinski triangle. We can look at 
the spread dimension profile and see that at certain 
scales the spread dimension is roughly the Hausdorff 
dimension of the corresponding fractal, indicating 
that spread is a reasonable measure of the size of 
these fractals, and, indeed, of these approximations 
to these fractals. 

In the first case we look at the ternary Cantor set. 
This is approximated by starting with two points 
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interpoint distance 



Figure 5: The spread dimension profiles for various rectangular grids. 



a distance I apart on a line. We use the two con- 
tractions of the line by a factor of a third which 
respectively leave the two points fixed. By applying 
these two contractions successively up to 10 times, 
starting at the initial points, we obtain 2048 points. 
The spread dimension at various lengths t can then 
be computed numerically. In Figure [6] we see that 
at small scales the spread dimension is close to zero 
corresponding to the fact that the space looks like a 
point at those scales. Similarly, at very large scales, 
the space looks like a collection of distant points and 
the spread dimension is again zero. At intermedi- 
ate scalings, roughly for 10 < i < 10000 the spread 
dimension is roughly the Hausdorff dimension of 
the Cantor set, namely In 2/ In 3, indicating that the 
space looks more 'Cantor set-like' at those scales. 

The top-right picture in Figure [6] is an enlargement 
of the Cantor set profile, and shows that things are 
apparently more intriguing than one might guess. 
At intermediate scales, the spread dimension seems 
to oscillate around the Hausdorff dimension, with 
the oscillations being of multiplicative period 3. Such 
small oscillations were observed for the magnitude 
of the Cantor set in (SJ- I have no good explanation 
for these oscillations at the moment. 

In the next case we look at the Koch curve. Again, 
this is approximated by starting with a couple of 
points and iteratively applying one of four contrac- 
tions, to obtain a finite metric space contained in 
the Koch curve. The graph shows that as the ap- 



proximation is scaled up from very small, the spread 
dimension increases to roughly the Hausdorff di- 
mension, In 4/ In 3, where it remains over a range of 
scales, before descending to zero as the approximat- 
ing space is scaled up sufficiently so that it's discrete, 
point-like nature is apparent. 

The final example of the Sierpinski triangle is gen- 
erated in the same way using an iterated function 
system, and shows the same behaviour, namely, of 
having roughly the same spread dimension as its 
Hausdorff dimension at certain scales. 

These examples should serve to show that there is 
something interesting going on which has yet to be 
examined fully. 
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Figure 6: The spread dimension profiles of finite approximations to certain fractals, compared to the 
Hausdorff dimensions of the fractals. 
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